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WHAT IS CLAIMED IS: 

1 . A document information search apparatus^ for 
searching document information on the bas^s of a search 
request transmitted through a network §Ad responding, 
wherein: 

a search condition designating/ unit which 
designates a file as a search condition and transmits 
contents of said designated flld via the network is 
provided for a search requesting source; and 

a document search unit ywhich forms a keyword from 
the file contents transmitted from said search 
condition designating uni/t and searches similar 
documents from a database is provided on a search side. 



15 2. An apparatus according to claim 1, wherein said 

search condition designating unit transmits a head file 
portion of the designated file contents. 

3. An apparatus according to claim 1, wherein said 
20 search condition designating unit allows an HTML file 
and an Excey file to be included in the file which is 
designated/as said search condition. 



An /apparatus according to claim 1, wherein 
25 ifldex information describing a list of important 

words/extracted from search target documents is stored 
every document in said database, 
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and said document search unit on the search s^ae 
comprises: / 

a text extraction processing unit which extracts a 
text document from the file contents received in 
response to the search request; / 

a morpheme analyzing unit which extracts nouns by 
a morpheme analysis of said text document; 

a keyword forming unit which extracts important 
words from said nouns and forms a/keyword in which said 
important words are coupled by JoR; and 

a search executing unit Which searches similar 
documents by searching the Search database by said 
keyword and notifies the parch requesting source of a 
search result. / 

5. An apparatus according to claim 4, wherein said 
keyword forming un/t counts the number of times of 
appearance showirtg in which documents in the index of 
each of the seaarch documents stored in said document 
database each /of said nouns appears, selects a 
predetermined number of upper words each having the 
number of times of appearance in a predetermined range, 
and forms /the keyword. 

6. An/apparatus according to claim 5, wherein in the 
case where the number of documents in the index is 
assuided to be (N), said keyword forming unit selects 
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upper ten words each having the number (H) of times /bf 
appearance in a range where 2N/3 > H > 1 and formsr the 
keyword . / 

7. An apparatus according to claim 5, wheyfein said 
keyword forming unit allows property infomiation 
extracted from the file received in response to the 
search request to be included in said Iceyword, thereby 
allowing the similar documents to be /searched . 

8. An apparatus according to claim 7, wherein said 
property information includes a/writer of the file 
received in response to the search request, a document 
title, and the like. / 

9. An apparatus accord/ng to claim 1, wherein said 
search condition designating unit of said search 
requesting source is provided by a WWW browser of a 
client, transmits the contents of the file designated 
by a search requesyC picture plane of said WWW browser 
to a search machime of a WWW server through the 
network, and sends said file contents to said document 
search unit. / 

10. A document information search apparatus 
comprising : 

a dfatabase in which index information describing a 
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list of important words extracted from search tai^fet 
documents has been stored every document; 

a text extraction processing unit which yfextracts a 
text document from contents of a file recei/ved in 
response to a search request from a network in which a 
document file has been designated as a /search 
condition; 

a morpheme analyzing unit whic^f extracts nouns by 
a morpheme analysis of said text gfocument; 

a keyword forming unit whig^i extracts important 
words from said nouns and formss a keyword in which said 
important words are coupled by OR; and 

a search executing uni/ which searches similar 
documents by searching saj^d database by said keyword 
and notifies a requesting source of a search result. 
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11. An apparatus according to claim 10, wherein said 
keyword forming uniy counts the number of times of 
appearance showing/ in which documents in the index of 
each of the search documents stored in said document 
database each of said nouns appears, selects a 
predetermined /lumber of upper words each having the 
number of times of appearance in a predetermined range, 
and forms tne keyword. 

12. An apparatus according to claim 10, wherein 
property information extracted from the search target 



documents is stored in said database together with yche 
index information, said keyword forming unit allovfs 
said property information extracted from the fiJ^e 
received in response to the search request toA)e 
included in said keyword, thereby allowing wie similar 
documents to be searched. / 

13. A document information search metnod of searching 
document information on the basis of /a. search request 
transmitted via a network and responding, comprising 
the steps of : / 

storing index information describing a list of 
important words extracted from/search target documents 
every document into a database; 

in the case where a f^Le is designated as a search 
condition on a search requesting source, transmitting 
contents of the designa-bed file to a server together 
with the search request? through the network; and 

on a search side% extracting a text document from 
the file contents received in response to the search 
request, extracting nouns by a morpheme analysis of 
said text document, extracting important words from 
said nouns, forming a keyword in which said important 
words are coupled by OR, searching similar documents by 
searching Solid database by said keyword, and responding 
a search :nesult. 
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14. A method according to claim 13, wherein as /a 
formation of said keyword, the number of timesr of 
appearance showing in which documents in ther index of 
each of the documents stored in said database each of 
said nouns appears is counted, a predetermined number 
of upper words each having the number yof times of 
appearance in a predetermined range ^ire selected, and 
the keyword is formed. / 

15. A method according to claijn 14, wherein property 
information extracted from th^ file received in 
response to the search request is included in said 
keyword and the similar documents are searched. 

16. A computer-readabLe recording medium in which a 
search program has been stored, wherein said program 
comprises the steps yof : 

receiving a search request in which a document 
file has been de^gnated as a search condition; 

extracting /a text document from contents of the 
file received /n response to the search request; 

extracting nouns by a morpheme analysis of said 
text document ; 

extracting important words from said nouns and 
forming a/keyword in which said important words are 
coupled py OR; and 

searching similar documents by searching a 
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database by said keyword and notifying a requesting 
source of a search result- / 

17. A medium according to claim 16, wherein in the 
step which forms the keyword of said srearch program, 
the number of times of appearance sKowing in which 
documents in the index of each oy the documents stored 
in said database each of said *founs appears is counted, 
a predetermined number of upper words each having the 
number of times of appearance in a predetermined range 
are selected, and the keyword is formed. 

18. A medium according to claim 16, wherein said 
search program further comprises a step which allows 
property information extracted from the file received 
in responser to the search request to be included in 
said keyword and searches the similar documents. 



